rank | frequency | n-gram |
---|---|---|
1 | 14008 | -o |
2 | 13088 | -e |
3 | 11024 | -a |
4 | 10057 | -i |
5 | 1419 | -” |
rank | frequency | n-gram |
---|---|---|
1 | 3582 | -to |
2 | 2956 | -re |
3 | 2678 | -no |
4 | 2631 | -te |
5 | 2572 | -ti |
rank | frequency | n-gram |
---|---|---|
1 | 1798 | -one |
2 | 1472 | -ato |
3 | 1339 | -are |
4 | 1186 | -nte |
5 | 1182 | -ano |
rank | frequency | n-gram |
---|---|---|
1 | 1499 | -ione |
2 | 822 | -ente |
3 | 649 | -ento |
4 | 513 | -ando |
5 | 468 | -ioni |
rank | frequency | n-gram |
---|---|---|
1 | 1178 | -zione |
2 | 535 | -mento |
3 | 485 | -mente |
4 | 342 | -zioni |
5 | 218 | -sione |
The tables show the most frequent letter-N-grams at the ending of words for N=1…5. Everything runs in parallel to 2.2.5 Most frequent word beginnings. The aim is suffix detection instead of affix detection.
For N=3:
SELECT @pos:=(@pos+1), xx.* from (SELECT @pos:=0) r, (select count(*) as cnt ,concat("-", right(word,3)) FROM words WHERE w_id>100 group by right(word,3) order by cnt desc) xx limit 5;
2.2.5 Most frequent word beginnings